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SECTION A 


1 Databases 


(a) 


(6) 


A relational algebra is defined over sets of tuples. Explain whether the relational 
union of two sets (relations) requires the schemas to share attribute names. Does 
the same consideration apply for intersection? [4 marks] 


In the same relational algebra, sets with P and Q records (table lengths) are 
joined by a binary operator. What is the minimum and maximum number of 
records in the answer if the operation is union? What if it is a natural join? 

[4 marks] 


A relational database of text books holds a small amount of information about 
each chapter of each book. Two relations (tables) are used with a total of seven 
distinct attribute names (fields). Draw a suitable E/R diagram and define the 
two corresponding relational schemas. Say what forms of key are present. 

[7 marks] 


The textbooks all relate to a common subject area and a more detailed database 
is now required, perhaps storing the books themselves. What new operations 
might be wanted from this new system, how would you structure it and would 
there be benefits from enforcing consistency rules? [5 marks] 
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2 Databases 


(a) 


(b) 


It is reckoned to be impossible to simultaneously provide the CAP trio in DBMS 
design: these are. G=-s+==-4= Vi ieee eee Vy oiid: P=sssesoe tassS-o5 ei 
Complete and define these terms and explain why. [4 marks] 


What is one significant difference between a reflexive relation in discrete maths 
and a database relation between a domain and itself? Give simple examples of 
both. Can either form of relation usefully have a one-to-one cardinality? 

[6 marks] 


An rDMBS holds tables with these four schemas: 
R1:(A, B, C, D), R2:(A, B,C), R3:(A, B, D, E, F) and R4:(A, D). 


You are told that values of F are always predictable from values of E, but it 

might be costly to make that prediction. Also, database updates might be much 

rarer than reads. What rearrangment of the schemas might be good and why”? 
[5 marks] 


An is_a relation between entity types is defining a two-level hierarchy: for 
instance, lions and whales are both mammals. There are three basically different 
ways that a two-level is_a hierarchy can be modelled using relational tables. 
What are they? Multi-level is_a relationships commonly arise: which way then 
might be best? [5 marks] 
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SECTION B 


3 Introduction to Graphics 


(a) Select the most suitable colour space for the following applications. Justify your 


choice. 
(2) Color picker in painting software. 2 marks 


(7c) An error metric for evaluating colour distortions introduced by a camera. 
2 marks 


(27) Storing high dynamic range textures for real-time rendering. 3 marks 


The diagram below shows a projection of the colour gamut and the primaries of 
two popular RGB colour spaces. 


— [—S—1Tu-R BT.709 
a —o— ITU-R BT.2020 
ra SS. 


(1) BT.709 primaries are rather far from the boundary of the visible colour 
gamut. What motivated that choice of primaries in BT.709? [3 marks] 


(77) Most modern displays do not reproduce all colours from the BT.2020 colour 
gamut. What motivated that choice of the colour gamut? [3 marks] 


You design a colour management system for a new display with atypical primary 
colours. You measured the spectra of its three primaries and stored measured 
samples in an Nx3 matrix D, in which the columns correspond to the three 
primaries and the rows to the spectral samples. Find a transformation matrix 
from BT.709 RGB to the values of the three native primaries of the display 
that provide a metameric match. You have an N x3 matrix Sxyyz with the XYZ 
colour matching functions and a 3x3 matrix Mzo9_,xyz to transform from RGB 
BT.709 to XYZ. Both input and output colour spaces are linear. You can ignore 
normalization constants. (7 marks] 
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4 Introduction to Graphics 


Your task is to design a rendering pipeline for a next-generation augmented reality 
headset. 


(a) Explain how you could improve the quality of rendering given the following 


hardware limitations. Keep your answers short, up to 100 words. 


(1) The high refresh rate display will be controlled by 6 instead of 8 bits for 
each colour channel. [3 marks] 


(7) The display resolution will be 1024x1024, resulting in a relatively large 
pixel size when magnified by the lens of the headset [4 marks] 


(itt) The headset is equipped with an optical see-through display in which the 
light from the environment is mixed (in an additive manner) with the light 
from the display. The headset has a sensor that can measure the overall 
amount of environment luminance reaching the display. [5 marks] 


The tracking sensors provide the position of the estimated centre of the left 
eye, ep, in the units of millimetres and the view direction, 0, both in the 
world coordinates. The distance between the two eyes in millimetres is djop. 
The headset uses a right-handed coordinate system, and the up vector is 
u = [0 1 0]. Derive view matrices for the left and the right eye cameras. 
Illustrate your answer with a diagram. [8 marks] 
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SECTION C 
5 Interaction Design 


ChatGPT is a natural language processing tool, developed by OpenAI, driven by AI 
technology that allows you to have human-like conversations with a chatbot. It has 
a Free ChatGPT Unlimited version, whose main screen is shown in the figure above, 
that is a free-to-use AI system. Based on a large language model, ChatGPT can 
answer questions and assist its users with tasks, such as composing emails and code, 
and enable users to refine and steer a conversation towards a desired length, format, 
style, level of detail, and language. 


(a) Identify four (different) groups of stakeholders of the Free ChatGPT Unlimited 
tool and provide a short description as to why they constitute a stakeholder 
group for this tool. 


[4 marks] 


(b) Nielsen’s heuristics used for Heuristic Evaluation are: (1) visibility of system 
status, (2) match between system and real world, (3) user control and freedom, 
(4) consistency and standards, (5) error prevention, (6) recognition rather than 
recall, (7) flexibility and efficiency of use, (8) aesthetic and minimalist design, (9) 
help users recognize and recover from errors, and (10) help and documentation. 


Using Figures 1-6, undertake a Heuristic Evaluation of the Free ChatGPT 
Unlimited. 


S New chat G ChatGPT 3.5 ~ 


Happy Memories Explained 


Ss) 


How can | help you today? 


Suggest some names Recommend a dish 
for my cafe-by-day, bar-by-night business to bring to a potluck 
Plan atrip Create a content calendar 
to see the northern lights in Norway for a TikTok account 
+ Upgrade plan 
ti GPT DALL-E, and more 
Message ChatGPT... 


© Hatice Gunes 


ChatGPT can make mistakes. Consider checking important information. 


Fig. 1. First / Main screen 
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(te 


S New chat G ChatGPT 3.5 v 


@ You 
Today 


Can you recommend me some appetizer options to bring to a potluck? 
Potluck Appetizer Recomn + © 


Happy Memories Explained 
© ChataPt 


Certainly! Here are some delicious appetizer options that are great for 


potlucks: 


i. Caprese Skewers: Skewer cherry tomatoes, fresh mozzarella balls, and 
basil leaves. Drizzle with balsamic glaze and sprinkle with salt and pepper. 
2. Stuffed Mini Peppers: Halve mini sweet peppers, remove seeds, and stuff 


with a mixture of cream cheese Vv tbs, and spices. Bake until peppers are 


, Upgrade plan tender. 
Get GPT-4, DALL-E, and more 


© Hatice Gunes 


IMessage ChatGPT... 


ChatGPT can make mistakes. Consider checking important information. 


Fig. 2. (starting from the main screen) Asking for appetizer options to bring to 
a potluck 


Settings x 


System Vv 


® General 


Multi-factor authentication Enabl 
‘I! Speech nee 
Require an extra security challenge when logging in. If you are 
unable to pass this challenge, you will have the option to 


€ Data controls recover your account via email. 


Always show code when using data analyst { @) 


Archived chats Manage 


Archive all chats Archive all 


Delete all chats Delete all 


Fig. 3. Clicking on user name and viewing and/or modifying Settings 
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Clear your chat history - are you sure? 


Cancel Confirm deletion 


Fig. 4. Clicking on user name. Viewing and/or modifying Settings — Click 
Delete All Chats 


Customize ChatGPT 


Custom Instructions © 


What would you like ChatGPT to know about you to provide better 
responses? 


Enable for new chats t @) 


Fig. 5. Clicking on user name and clicking on Custom Instructions 
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S New chat G ChatGPT 3.5 ~ ALN 
Today 
© You 
Paulie pstben eee 7 9 (5) can you provide me an image of a dog? 
Happy Memories Explained 
© cChatGPT 
| can't provide images directly, but you can easily find images of dogs by 
searching online using a search engine like Google Images or a stock photo 
website. Just enter "dog" into the search bar, and you'll find plenty of 
adorable dog pictures to enjoy! 
Upgrade plan 
+ t , DALLE 
Message ChatGPT... 
© Hatice Gunes 
ChatGPT can make mistakes. Consider checking important information. 
Fig. 6. (starting from the main screen) Asking for an image of a dog 
[5 marks] 


You are working as a member of the Free ChatGPT Unlimited design team, and 
the management team has asked you to identify whether the current interface 
needs improvement and why. You decided to conduct interviews with the 
four groups of stakeholders you have identified in (a). Provide two questions 
for each stakeholder group to understand whether the current interface needs 
improvement or not, and why. [8 marks] 


Describe how you will analyse the interview data gathered in (c), and what 
your next steps will be to improve the validity and reliability of your findings to 
support the notion that the current interface needs improvement (or not). 

[3 marks] 
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6 Interaction Design 


Financial literacy is the ability to understand and effectively use various financial 
skills, including personal financial management, budgeting, and investing. Low 
financial literacy threatens the well-being of individuals and families, especially in 
underserved and low-income communities. Without a solid financial foundation, 
young people are more susceptible to predatory lending and costly errors in managing 
debts and expenses that can lead to lifelong financial inequity. Your company has 
been tasked with creating an Android and iOS app to help young people with financial 
literacy. 


(a) Think about the interactive product you are designing, and identify and describe 
with rationale three key requirements that the application must meet. 


[3 marks] 
(b) Sketch out a design for this app with relevant screens and details, illustrating 
how it meets the set of requirements you have listed in (a) by providing labels 

and explanatory captions, and descriptions of the interactive aspects. 


[5 marks] 


(c) Create two user personas for this app and describe how you can use these to 
guide your design choices. [6 marks] 


(d) Describe how you will utilise and apply the Gestalt principles to the design you 
created in (b) to enhance the user experience. 


[6 marks] 
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SECTION D 


7 Machine Learning and Real-world Data 


A language school for English receives students from different countries and with 
different skill levels. Before entering, students perform an English test, which decides 
which class they are assigned to (B1, B2 or I). After studying for a week, students 
are sometimes reassigned to a different level better reflecting their actual language 
ability. 


(a) 


Professor M is unhappy with this process and the test’s ability to predict student 
level. She suggests that the school should derive students’ final level directly by 
machine learning, based on the students’ age, their first language (L1), and how 
long they studied English before. Several years’ data from previous students is 
available. Describe how the task could be accomplished using a Naive Bayesian 
Classifier. Apply smoothing if this is appropriate. Give all relevant formulae for 
parameter estimation and classification. [4 marks] 


Calculate all relevant probabilities for features age, L1 and experience, for your 


classifier defined in a), using the following sample of student data. . [6 marks] 
Test performance Student Stats Level 
ID | Total Question Assignment 
Score 1 2 3 4 5 6 7 8 9 Age Ll Exper. | Initial Final 
A 4 ee e e | [21-24 C 5-7 B2 B2 
B 5 ° eo ee e 13-16 F 1-4 B2 B2 
C 1 e 21-24 F 1-4 Bl I 
D 3 ee e | {17-20 C 5-7 Bl B2 
E 7 e@ @ @ ° ° e 13-16 C 5-7 I B2 
F 5 e@ ee e ee 21-24 F [> 8 B2 I 
G 6 e e e ° ee 17-20 C 5-7 I I 
H 2 ° e 17-20 C 1-4 Bl Bl 
I 8 e@© © e© @ @ e e| [13-16 F [> 8 I I 
J 5 @ e@ ° e e 21-24 F 1-4 B2 Bl 


A new student enters the school whose L1 is “C”, who has studied English for 5 
years, and who is 18 years old. Which level will this student be assigned to by 
your classifier from a), as trained in b), and why? [2 marks] 


Some features influence the prediction of the classifier more than others. How 
could you use the data available to you to determine the relative relevance 
of individual features? Describe at least two methods and give a numerical 
illustration for at least one of your methods, using the above data. [4 marks] 


The school are now re-thinking how they create classes. Rather than relying on 
level descriptions such as B1, they want to define classes by grouping incoming 
students of similar ability into classes of roughly the same size. Describe a 
method how this could this be achieved. You may use all information in the 
table above, excluding the information on levels. [4 marks] 
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8 Machine Learning and Real-world Data 


You are interested in predicting snowfall in order to plan for a winter trip with friends. 
You know that for snowfall to occur, we need to have freezing air temperatures, even 
if this is not always sufficient. 


You decide to model this using a first order hidden Markov model (HMM), with air 
temperature as the hidden state (Freezing or NotFreezing) and Snowfall, Rain or Dry 
as the observations. Then you take a look at data from recent months and see: 


Month Oct Nov Dec Jan Feb Mar Apr May Jun 
AirTemp NF NF F Fe F NF FE NF NF 
Weather R R S) S) D R D D D 


(a) Define and estimate the components of an appropriate HMM for this application, 
without smoothing. Assume that all hidden states are equally likely to start the 
sequence. Ignore the start and end states. 


[4 marks] 


(b) Answer the following questions using the model you estimated. Provide the 
calculations needed to arrive in each answer: 


(2) What are is most likely combination of air temperature and weather for the 
next month (July) ? 


(7) What is the most likely combination of air temperature and weather in 
three months from now (September)? 


[4 marks] 


(c) We are in June, and your friends predict that July and August will be dry, 
followed by rain in September and snow in October. If they are right, what is 
the most likely sequence of air temperatures for July to October? 


[6 marks] 


(d) The current model you have estimated ignores the time of the year we are in, 
e.g., the probability of transitioning from NotFreezing to Freezing is independent 
of whether we are in summer or in winter. Build a better model by taking 
into account the time of the year, by changing the definition of hidden states. 
Describe any transformations necessary to the data and estimate the parameters 
of the new model. 


[6 marks] 
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9 Machine Learning and Real-world Data 


You are a football academy manager and you want to design a classifier for deciding 
which players to recruit to your academy. For this purpose you gather recent data 
(2020 and onwards) on players about to finish high school. You construct the 
following table, where each row represents a potential recruit. The column Success 
indicates whether the player was considered a successful recruit or not; it is the label 
we are trying to predict. 


Success Goals Position Gender 


Y Many Attack M 
N None Goalkeeper FE 
Y Few Defender M 
Y Few Attack M 
N None Defender M 
N Few Defender F 


You want to develop a model to help you in deciding whether you should recruit a 
player based on their features. You decide to use a Naive Bayes Classifier. 


(a) Define and estimate the parameters of the Naive Bayes Classifier. [4 marks] 


(b) There is a suspicion that the model you estimated is biased. If a model is 
biased, it treats certain groups of the population unfairly, i.e. it never predicts 
their members to be successful. Identify two such groups and demonstrate the 
problem by constructing appropriate test instances. [4 marks] 


(c) Explain which property of the classifier you developed in part (b) enables you 
to make this judgement based on the parameters you have estimated. Give one 
reason why it is a useful property for the model to have and one reason why this 
is a problematic property. [2 marks] 


(d) You are given more data from an earlier time period, pre-2020. How would you 
incorporate it in your experimental setup for the classifier you developed in part 
(a)? [4 marks] 


(e) The model you developed predicts success or not based on a snapshot of the 
potential recruits at a single point in time. However you are recruiting for an 


academy, so you want to take into account the trajectory of the players over a 
number of years. Propose a modelling solution for this. [6 marks] 


END OF PAPER 
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